Multiclass Classification of Unconstrained Handwritten Arabic Words Using Machine Learning Approaches
نویسندگان
چکیده
In this paper, we propose and describe efficient multiclass classification and recognition of unconstrained handwritten Arabic words using machine learning approaches which include the K-nearest neighbor (K-NN) clustering, and the neural network (NN). The technical details are presented in terms of three stages, namely preprocessing, feature extraction and classification. Firstly, words are segmented from input scripts and also normalized in size. Secondly, from each of the segmented words various feature extraction methods are introduced. Finally, these features are utilized to train the K-NN and the NN classifiers for classification. In order to validate the proposed techniques, extensive experiments are conducted using the K-NN and the NN. The proposed algorithms are tested on the IFN/ENIT database which contains 32492 Arabic words; the proposed algorithms give good accuracy when compared with other methods.
منابع مشابه
Word based off-line handwritten Arabic classification and recognition : design of automatic recognition system for large vocabulary offline handwritten Arabic words using machine learning approaches
......................................................................................................................................... I ACKNOWLEDGMENT .................................................................................................................... III PUBLICATIONS ...............................................................................................................
متن کاملOff-line Arabic Handwritten Recognition Using a Novel Hybrid HMM-DNN Model
In order to facilitate the entry of data into the computer and its digitalization, automatic recognition of printed texts and manuscripts is one of the considerable aid to many applications. Research on automatic document recognition started decades ago with the recognition of isolated digits and letters, and today, due to advancements in machine learning methods, efforts are being made to iden...
متن کاملEvaluation of Ensemble Classifiers for Handwriting Recognition
One of the major developments in machine learning in the past decade is the ensemble method, which finds highly accurate classifier by combining many moderately accurate component classifiers. In this research work, new ensemble classification methods are proposed for homogeneous ensemble classifiers using bagging and heterogeneous ensemble classifiers using arcing classifier and their performa...
متن کاملPerformance of hidden Markov model and dynamic Bayesian network classifiers on handwritten Arabic word recognition
This paper presents a comparative study of two machine learning techniques for recognizing handwritten Arabic words, where hidden Markov models (HMMs) and dynamic Bayesian networks (DBNs) were evaluated. The work proposed is divided into three stages, namely preprocessing, feature extraction and classification. Preprocessing includes baseline estimation and normalization as well as segmentation...
متن کاملWord-Based Handwritten Arabic Scripts Recognition Using Dynamic Bayesian Network
In this paper, multi-class classification system is of handwritten Arabic words using Dynamic Bayesian Network (DBN) is proposed, in which technical details are presented in terms of three stages, i.e. preprocessing, feature extraction and classification. Firstly, words are segmented from inputted scripts and also normalized in size. Then, features are extracted from each normalized word, where...
متن کامل